Improving XPath Evaluation with Strategies

نویسندگان

  • Jeffrey Palm
  • Karl J. Lieberherr
چکیده

XPath is the defacto navigation language for XML documents conforming to DTD and it is important to have efficient evaluation and checking techniques available for XPath. Traversal strategies, a component of Adaptive Programming, are a well studied navigation language for objects conforming to class graphs and an efficient evaluation technique has been developed in the previous millenium. Traversal strategies don’t deal with the full generality of XPath but instead they focus on ancestor-descendent (ad) edges. Indeed, a traversal strategy is a graph consisting only of ad edges with optional negative constraints on the edges. While it has been widely acknowledged in the data base community that DTD may speed up XPath evaluation, no paper has shown an exponential improvement by using the DTDs. We show an infinite sequence of XPath expression/DTD/document triples so that evaluation using the meta information in the DTDs is exponentially faster. We draw a strong connection between traversal strategy evaluation and XPath evaluation and show how this leads to comprehensive logical optimization techniques for XPath. Our evaluation is optimal in the sense that by visiting one node less than our approach visits would lead to wrong results. Finally, we present empirical results of our speed up and explore various classes of XML documents that can benefit from this type of evaluation. We believe that the automata-theoretic algorithmic ideas presented in this paper should be an important building block of any efficient XPath evaluator. General Terms Logical optimization for schema-based XPath evaluation, optimal XPath evaluation, Adaptive Programming, metainformation-guided search. 1 March 2, 2005

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XPath Query Evaluation: Improving Time and Space Efficiency

Contemporary XPath query engines evaluate queries in time exponential in the sizes of input queries, a fact that has gone unnoticed for a long time. Recently, the first mainmemory evaluation algorithm for XPath 1.0 with polynomial time combined complexity, i.e., which runs in polynomial time both with respect to the size of the data and the queries, has been published (cf. [11]). In this paper,...

متن کامل

A Logic-Based Approach to Cache Answerability for XPath Queries

We extend a recently proposed model checking-based algorithm for the evaluation of XPath queries with a cache strategy to store the results of the (most frequently) asked queries and to re-use them at occurrence. We experimentally show that, as soon as the cache is warm, the proposed optimization is quite effective. We complement our proposal with a broad experimental comparison of different st...

متن کامل

A Study of Positive XPath with Parent/Child Navigation

We study the expressiveness of Positive XPath with parent/child navigation, denoted XPath, from two angles. First, we establish that XPath is equivalent in expressive power to some of its sub-fragments as well as to the class of tree queries, a sub-class of the first-order conjunctive queries defined over label, parent, and child predicates. The translation algorithm from tree queries to XPath ...

متن کامل

Optimal Buffering for Streaming XPath Evaluation

We motivate and present a definition of optimal buffering for streaming evaluation of XPath queries. We consider a large fragment of XPath that includes multiple (correlated) subqueries and reverse (up the document tree) axes. We describe a method for XPath evaluation with optimal buffering. We present the results of an experimental evaluation of our methods based on our implementation, which i...

متن کامل

Trie Indexes for Efficient XML Query Evaluation

As the number of applications that rely on XML data increases, so does the need for performing efficient XML query evaluation. A critical part of the solution involves providing new techniques for designing XML indexes and lookup algorithms. In this paper, we leverage the results of our research on coupling the partitions induced by fragments of XPath algebra and those induced by the structural...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005